Human and computer learning: An experimental study 
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Simple memorizing tasks have been chosen such as a binary code on a 5 x 5 matrix. After the 
establishment of an appropriate protocol, the codified matrices were individually presented to 150 
university students (conveniently pre-selected) who had to memorize them. Multiple presentations 
were offered seeking perfect performance verified through the correct reproduction of the code. We 
measured the individual percentual error as a function of the number of successive presentations, 
and then averaged over the examined population. The learning curve thus obtained decreases 
(almost monotonically) until becoming virtually zero when the number of presentations attains 
six. A computer simulation for a similar task is available which uses a two-level perceptron on 
which an algorithm was implemented allowing for some degree of globality or nonlocahty (technically 
referred to as entropic nonextensivity within a current generalization of the usual, Boltzmann-Gibbs, 
statistical mechanics). The degree of nonextensivity is characterized by an index q, such that 
q — 1 recovers the usual, extensive, statistical mechanics, whereas q ^ 1 implies some degree of 
nonextensivity. In other words, q — 1 is a (very sensitive) measure of globality (gestalt perception 
or learning). The computer curves fit well the human result for q ~ 1.02. ft has been verified that 
even extremely small departures of q from unity lead to strong differences in the learning curve. Our 
main observation is that, for the very specific learning task on which we focus here, humans perform 
similarly to slightly nonextensive perceptrons. In addition to this experiment, some preliminary 
studies were done concerning the human learning of ambiguous images (based on figure-background 
perception) . In spite of the complexity of drawing conclusions from such a comparison, some generic 
trends can be established. Moreover, the enormous and well known difficulty for computationally 
defining semantic, hierarchic and strategic structures reveals clear-cut differences between human 
and machine learning. 



I. INTRODUCTION 

It is possible to consider that cognitive psychology ap- 
peared as a reaction to behaviorist approaches, where 
the mental content plays the role of a black box [jl 0] • In 
contrast, this content constitutes a central issue in cog- 
nitive sciences. Consequently, the use of computers to 
implement or imitate human intellectual tasks naturally 
emerged as a methodological tool, and even as a pow- 
erful metaphor, in the investigation of mental processes 
such as intelligence, learning, memorization, among oth- 
ers. It is along these lines that we prepared a sequence of 
experiments with humans, to be later on compared with 
similar experiments, or simulations, done with computers 
provided with appropriate algorithms, in particular sim- 
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pie perceptrons. It should of course be clear that such 
comparisons have philosophical implications (see, for in- 
stance, 0,0, EL HI) > which we address in the present work 
only in the Conclusions (Section V). 

In the present study, two different memorization tasks 
were implemented. The first of them was relatively sim- 
ple, namely the memorization of simple binary codes in 
5x5 and 7x7 matrices. Most of the present effort is 
dedicated to this analysis. The second task was, intellec- 
tually speaking, sensibly more complex. It consisted of 
learning ambiguous images, where the figure-background 
reversal is crucial. Although semantic and strategic as- 
pects are present in both learning tasks, the second one 
is by far more delicate and reveals higher level cognitive 
phenomena. Consistently, our study leads to quantita- 
tive information concerning the first task, whereas only 
some qualitative features were determined for the latter. 

At the level of the comparison with computational sim- 
ulations, our emphasis is put on whether learning occurs 
in an extensive or a nonextensive manner. These terms 
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will be mathematically defined and further analyzed in 
Section III. They are currently used in statistical mechan- 
ics and thermodynamics, branches of physics dedicated 
to the study of the connections between the microscopic 
and the macroscopic worlds. At the present stage, it 
is enough to think of cxtensivity as a form of nonglob- 
ality or locality 0, Q, as opposed to nonextensivity or 
globality or nonlocality. In a loose manner, they respec- 
tively correspond to the molecular and molar approaches 
in psychology, i.e., the system is perceived as the sum of 
its parts, or as different from the sum of its parts. 

In Section II we present the experimental study with 
humans. In Section III we describe the entropic concepts 
that we use with regard to the computational simulations 
that have been implemented. In Section IV we compare 
both approaches, namely with humans and computers. 
Finally, we conclude in Section V. 



II. THE EXPERIMENT 

The experiment consists in individually learning fixed 
binary codes (represented by the signs + and O) m n x n 
matrices. It was presented to each person, initially one 
5x5 matrix (see Fig. 1) and, then, two 7x7 binary ma- 
trices (see Figs. 2 and 3). In order to avoid any kind of 
uncontrolled cultural effect, the location of the symbols 
+ and O m a U the matrices was randomly generated by 
computer and fixed once for ever. It was carried out with 
approximatively 150 university students whose specialty 
was not directly related to geometry, mathematics or im- 
ages, in order to avoid professional bias. For instance, 
students of physics, mathematics, engineering, architec- 
ture, visual publicity were excluded. The experiment 
population was mainly constituted by students of psy- 
chology, administration, social service and social comu- 
nication of the Federal University of Rio de Janeiro. Ap- 
proximately 30 students were used for preliminary tests 
in order to fix an optimal experimental protocol (matrix 
sizes, binary code of each matrix, exhibition and hidden- 
ing times, among others). Then, precisely 120 (92 female 
and 28 male) students were exposed to the same proto- 
col, and only their results were taken into account for 
quantitative purposes in the statistical processing (aver- 
aging, in particular). Their ages ranged between 17 and 
27 years old, the majority of them being around 22 years 
old. 

Each of them was sequentially isolated in a peaceful 
room and tested, for about 40 minutes, by one of us. In- 
formation about the scope of the research was given to 
each one of the 120 individuals . It was told that the 
experiment was not measuring their intelligence (so that 
they would feel relaxed), and that it was important for 
understanding how learning occurs (so that they would 
seriously try to perform satisfactorily). After these in- 
structions were given to each one of the 120 students, 
three matrices were shown. In all cases, a 5 x 5 matrix 
was first shown, one of the two complementary matrices 



(i.e, obtained by interchanging the symbols + and o). 
Then, one of the four 7x7 matrices was shown (matrix 
in Fig. 2 to 30 students, matrix in Fig. 3 to other 30, and 
so on). Then the other noncomplementary 7x7 matrix 
was shown. The sequence of each individual test was as 
follows: 

(1) after some general explanations, an empty 5x5 ma- 
trix and the O an d + symbols were shown, and the sub- 
jects were asked to randomly fill the matrix with the two 
symbols. This step aimed to provide to the individuals 
some familiarity with the experiment; 

(2) then Fig. 1 was exhibited during 8 seconds and then 
hidden. The student had to try to reproduce it on the 
spot on an empty matrix. When this was done, the op- 
eration was repeated after a rest interval of 10 seconds. 
The 5x5 matrix was never shown more than 10 times 
(the individuals started feeling tired after 10 times). The 
matrix was considered to be learnt if no error was made 
after two successive exhibitions; 

(3) then the learning test was repeated by successively 
using two of the four 7x7 matrix, one at a time; 

(4) finally, the student was asked to briefly describe how 
he(she) proceeded to learn. 

The error €i(t) of the i-th individual (i — 1, 2, 3, 120; 
t = corresponds to the initial random filling) is defined 
as the number of wrong elements of the filled nxn matrix 
(n = 5,7); < ti(t) < n 2 . Typical results are presented 
in Figs. 4, 5 and 6 for the 5x5 and in Figs. 7 and 8 
for one of the four 7x7 matrices (the figures associated 
with the other three 7x7 matrices are quite similar in 
fact). Although in a quite different context, results that 
have some connection with the present ones have been 
exhibited in Q . 

The averages (e){t) = ^E^e^t) ( with N = 120 if 
the entire set of students is used for averaging) are shown 
in Fig. 5. These curves already achieved the aspect pre- 
sented in this figure when the averages were performed 
with approximately 80 individuals. Using the entire set of 
120 results just improved the precision but incorporated 
no new qualitative elements in the curves. If we define 
the learning time as the number of times shown before 
learning, excluding those who did not suceed learning 
that particular matrix untill the end of the experiment, 
we can check that it is of the order of 6. 

Incidentally we verified an interesting (and indeed un- 
expected) cultural phenomenon. For the 5x5 matrix we 
were expecting (e)(0) to be close to 25/2 = 12.5 since, 
before starting to show the codified matrix, we asked to 
randomly fill the empty matrix with symbols o and +, 
indicated in this order above the empty matrix, if we 
look at them from left to right. In variance with this 
reasonable expectation, we found, for the first 60 stu- 
dents, (e)(0) ~ 14. After some hesitation about what 
could be the cause of this asymmetry (e.g., could it be 
the different semantics humans associate with a circle or 
a cross?), we speculated that it could be the fact that 
Portuguese language (within which the Brazilian popu- 
lation is educated) is read from left to right. Then, to 
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the second and last set of 60 students, an empty Fig. 
1 was presented with the symbols in the ordering + 
o instead of o +. Very symptomatically, we then ob- 
tained (e)(0) ~ 11.5, the overall average being conse- 
quently (14+ 11.5)/2 = 12.75, reasonably close to 12.5 
as initially expected! To confirm this cultural cause of 
the observed asymmetry, it would be interesting to re- 
peat the experiment with say arabic students (educated 
within right-to-left reading). 

Another interesting feature that we observed is that, 
for many individuals, (e)(2) > (e)(1), thus systematically 
contradicting the overall monotonic tendency of (e) (t) to 
decrease with time. The reason for this kind of a priori 
unexpected behavior appeared to be (as commented by 
the individuals themselves during the free final conver- 
sation) that, after seeing for the first time the code to 
be learnt, the student dedicated a good part of his (her) 
attention to establish a "strategy" for learning rather to 
properly learn the matrix. Let us mention, by the way, 
that in an experiment like the present one it is quite 
hard to differentiate between learning the strategy and 
memorizing the matrix within that strategy. This kind 
of modelization is supported by the fact that, several 
months after conclusion of the experiment, quite a few 
students still remembered the strategy, while they had 
completely forgotten the particular matrix code itself. 

Let us now briefly comment the second experiment we 
developed. We chose several ambiguous images, all of 
them being susceptible of two mutually excluding inter- 
pretations on the basis of figure-background reversal. For 
example, if one sees in Fig. 15 [9| a young woman, one 
does not simultaneously see the old woman, and recipro- 
cally . We implemented this more complex learning task 
by showing the image and then asking to the individual 
what he sees. Then we tried to count the time needed by 
the person in order to recognize the other image interpre- 
tation. This perception mechanism is sometimes referred 
to as reversal of the figure-background. It turned out that 
the times involved in this type of experiment, and very 
specifically the slowness of learning, if any, how to recog- 
nize the reversal, were so ill-defined that we decided not 
to proceed with this protocol. The question of how to 
conveniently quantify such learning remains, therefore, 
an open question (see also [ToUTHl. 



we are referring to in the present paper. The Boltzmann- 
Gibbs-Shannon (BGS) entropy is the basis of standard 
statistical mechanics and thermodynamics. It is defined 
(in its discrete version) by [T^ 
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where W is the number of microscopic possibilities acces- 
sible to the system, and {pi} are the associated probabil- 
ities C^2i—iPi = 1); for simplicity, we have taken Boltz- 
mann constant equal to unity. This entropy becomes 
maximal at equiprobability, i.e., pi = l/W for all i, and 
achieves the value 
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which is the celebrated Boltzmann formula. 

If we consider a composite system made of two (prob- 
abilistically) independent systems A and B, i.e., if we 
assume that pfj +B = pfpf, and replace this into Eq. 
(1), we straightforwardly obtain 

S BGS (A + B) = S bgs (A) + S BGS (B) , (3) 

which can be phrased as the entropy of the whole is the 
sum of the entropies of the parts. The entropic form (1) 
is the basis of standard, Boltzmann-Gibbs (BG), statisti- 
cal mechanics and thermodynamics, and property (3) is 
known as extensivity or additivity. This entropy, as well 
as others, are in some sense ubiquitous. Indeed, they 
emerge in a great variety of discussions. For example, 
they have often been used concerning complex phenom- 
ena such as the organization of living matter (see, for 
instance, [l^L as well as other types of organization, in- 
cluding that of knowledge (e.g., memorization and learn- 
ing), economics, linguistics, to mention but a few (see, for 
instance, |14L ll5| L Before addressing the generalization 
of Sbgs we are interested in here, let us mention that 
optimization of Eq. (1) in the presence of a constraint of 
the type JZ]^ PiEi = constant (where Ei might be say 
microscopic energy levels), leads to 
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III. ENTROPIC CONCEPTS AND 
COMPUTATIONAL SIMULATIONS 

A great variety of computer learning algorithms are 
available in the literature. It is clear that all of them 
process, in one way or another, information. Entropy is 
well known to be a convenient tool for quantifying (lack 
of) information. It can therefore be used in the context of 
any learning algorithm, at least in principle. We shall use 
it here in connection with the specific perceptron we shall 
describe later on. For convenience, let us briefly review 
at this point some basic notions about the entropic forms 



where (3 is a parameter to be determined through the 
value of the constraint. In statistical mechanics, Eq. (4) 
is in fact the celebrated Boltzmann-Gibbs weight. 

In 1988, one of us (CT) proposed Q3 the generaliza- 
tion of BG statistical mechanics on the basis of a more 
general entropic form, namely 
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q being any real number. We can verify that Si — Sbgs, 
in other words, the BG formalism becomes now the q = 1 
particular case of this more general formalism. If we 



assume, once again, two independent systems A and 
we can prove that 

S q (A + B) = S q (A) + S q (B) + (1 - q)S v (A)S q (B) , 

which can be phrased as the generalized entropy of 
whole is different from the sum of the generalized 
tropies of the parts. This property is referred to as not 
tensivity or nonadditivity. To be more precise, if 
take into account that S q is always zero or positive, 
(6) implies that S q {A + B) > S q (A) + S q (B) if q ■ 
and S g (A + B) < S q {A) + S q (B) if q > 1. Onl 
q = 1 we have that S q (A + B) = S q (A) + S q {B). 
is from property (6) that the terms nonextensive ste 
tical mechanics and thermodynamics have been coi 
(for reviews see [Tsj). Analogously to what we did 
fore, if we optimize 5^ in the presence of the constr; 
Y^A=\V\Eil Y^i=\V\ = constant, we obtain 

Pi oc [1 - (1 - q)f3'E i ] 1 /( 1 -^ , (T 

where, as before, f3' is a parameter to be determinec 
through the value of the constraint. Notice that for th< 
normal regime for /3', i.e., (3 1 > 0, we have, for large 
values oi Ei, a long power-law tail for q > 1, whereai 
we have a short exponential tail for q = 1; finally, foi 
q < 1, we have a cutoff. Eq. (7) can be re- written in th< 
Boltzmann-Gibbs form, namely 
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ln[l - (1 - q)pEi] 
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In other words, in what concerns the optimizing distri 
bution, E\ plays the role of an effective energy which re 
places Ei (in the q — * 1 limit, of course we recover E[ — * 
Ei). This effective energy can be used to q— generalize 
a variety of microscopic and mesoscopic equations. On< 
such example is the Langevin equation, on which the per 
ceptron that we use here has been constructed. 

As we see, the entropic property (6) has a kind o 
gestalt-\ike flavor. Its use constitutes a natural choice i 
we desire to deal with informational phenomena involv 
ing global or nonlocal aspects. Since this might well b< 
the case of human learning, we have adopted this formal 
ism in order to have the possibility of comparing humai 
and machine learnings. 

To do so, a nonextensive perceptron has been imple- 
mented which performs a task similar to the learn 
ing of the 5x5 and 7x7 matrices that were exposed tc 
the students, according to the experimental protocol w< 
described earlier. In order to perform calculations, th< 
perceptron needs an internal dynamical equation. To ful- 
fill this requirement, the ^-generalized Langevin equatioi 
previously introduced by Stariolo was implementec 
in the perceptron. Some typical runs of the perceptron 
are shown in Fig. 4, and typical averages are shown in 
Figs. 16, 17 and 18 (from [ig). 
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FIG. 2: First 7x7 matrix. This matrix is referred to as the 
+diagonal matrix. Its dual matrix is obtained by permutating 
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FIG. 3: Second 7x7 matrix. This matrix is referred to as the 
Ocolumn matrix. Its dual matrix is obtained by permutating 
the + and the O symbols, and is referred to as the -fcolumn 
matrix. 
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FIG. 4: Typical examples of time evolution of the error as- 
sociated with the 5x5 matrix. The three left correspond to 
three different individuals (the abscissa is the number of pre- 
sentations of the matrix; the ordinate is the number of matrix 
elements incorrectly reproduced). The three right (see [16] for 
further details) correspond to three different initial conditions 
for the perceptron (the abscissa is the number of iterations; 
the ordinate is the percentual error). 



IV. COMPARISON OF THE HUMAN AND 
COMPUTER RESULTS 



The purpose of the present section is to compare the 
results obtained with humans and those obtained with 
the nonextensive perceptron. The comparison will be il- 
lustrated on the learning/memorizing of the 5x5 matrix. 
We shall verify that, for this specific task, the human and 
perceptron results can be amazingly similar. This can be 
checked on Fig. 4. The three individual results (on the 
left) are indeed similar to the perceptron realizations with 
three different initial conditions (on the right). The three 
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FIG. 5: Average error associated with the learning of the 
5x5 matrix as a function of the number of presentations: 
circles, squares and triangles respectively correspond to aver- 
aging over 73, 92 and 120 individuals. 



human examples have been chosen as to exhibit typical 
cases. The three perceptron examples have been chosen 
in order to have an overall aspect similar to the human 
ones. 

Averages over many realizations of the data just pre- 
sented are shown (with dots) in Figs. 16 and 18. We have 
rescaled the time variable in such a way that comparison 
becomes possible on the same graph. More precisely, we 
have expressed time in units of the corresponding half- 
time t m , defined as the value of time at which the av- 
erage error curve decays to its half value. A rescaling 
such as this one is clearly necessary in order to quan- 
titatively compare the results. Indeed, human "time" is 
here represented as the number of presentations, whereas 
perceptron "time" essentially corresponds to the number 
of computer iterations. These two numbers being of a 
completely different nature (see also @), it is clear that 
rescaling becomes necessary. 

We may consider Fig. 18 as the central result of the 
present work. We verify that, for the specific task of 
learning/memorizing 25 binary states (on a matrix for 
the humans), humans and machines are remarkably sim- 
ilar. Of course, the parameters of the perceptron have 
been chosen in such a way as to optimize the overall 
fitting to the human data. The number of individuals 
that have been averaged is 120, and we have verified that 
no sensible variation is obtained under increase of that 
number. In the language of statistical mechanics, we may 
say that we have practically attained the thermodynamic 
limit. There is a little bump in the human results at 
t/t m ~ 2: we have not identified its origin (perhaps fa- 
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FIG. 6: Histogram of the probabilities corresponding to the 
5x5 matrix. The abscissa is the ordinal of the presentation 
at which the individual learnt the matrix: 82 (out of 120) 
individuals learnt the matrix before or at the 9th presentation 
(we recall that the 10th presentation was used to confirm the 
learning at the 9th one); 38 individuals did not succeed (and 
are not computed in the histogram) . 



FIG. 7: Average error associated with the learning of the 
7x7 +diagonal matrix as a function of the number of pre- 
sentations: the circles correspond to averaging the results of 
30 individuals to whom the +diagonal matrix was shown in 
first place; the squares correspond to averaging the results of 
30 individuals to whom the -fdiagonal matrix was shown in 
second place (after having seen, in first place, the O commn 
matrix) . 



tigue of the tested individuals, perhaps something else) 
. The perceptron does not exhibit such bump. As we 
see, the perceptron that fits the data has some degree 
of nonextensivity, as was conjectured in the beginning of 
the present work. Although q is quite close to unity, we 
must take into consideration the fact that the error curves 
have been shown to be (see Fig. 17) extremely sensitive 
to the degree of nonextensivity in the neighborhood of 
q = 1 (extensive case). 



V. CONCLUSIONS 

In summary, we have implemented an experimental 
study aiming to measure the learning/memorizing per- 
formance of humans on simple codes on matrices. It is on 
purpose that we simultaneously use the words "learning" 
and "memorization". Indeed, the experiments clearly 
showed that the improvement of correct answers was due 
to a mixture of memorization of the specific codified ex- 
ample and devising learning strategies (symmetry rules 
and other mnemonic tricks) in order to efficiently im- 
plement the memorization effort. In fact, after several 
months, we informally verified that the individuals had 
forgotten the codes, but still remembered the strategy 
they used for memorizing them. 

Through comparison with machine results, we verified 
that this particular human task was executed with clear 
indication of a slight, though very efficient nonextensivity 
(or globality) quantified by the entropic index q. This in- 
dex appeared to be slightly above unity, which character- 
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FIG. 8: Histogram of the probabilities corresponding to 
the first shown 7x7 matrix, either the -l-diagonal or the 
Odiagonal ones. The abscissa is the ordinal of the presenta- 
tion at which the individual learnt the matrix: 50 (out of 60) 
individuals learnt the matrix before or at the 9th presentation 
(we recall that the 10th presentation was used to confirm the 
learning at the 9th one); 10 individuals did not succeed (and 
are not computed in the histogram). 



FIG. 9: The well known ambiguous figure "My Wife and My 
Mother-in-law", created by W.E. Hill in 1915, and originally 
published in Puck. 
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FIG. 10: Typical average error curves for the perceptron (5x5 
matrix) as a function of the number of iterations (T is a pa- 
rameter of the perceptron). The perceptron has been chosen 
with 5 x 5 = 25 binary inputs, in order to simulate the human 
task (dots) as closely as possible on the average (taken on 92 
individuals in this example). See [16] for further details. 



FIG. 11: Typical average error curves for the perceptron (5x5 
matrix) as a function of the number of iterations (T is a pa- 
rameter of the perceptron) . We notice the extreme sensitivity 
to the value of q in the neighborhood of q = 1. See [16] for 
further details. 

izes slower learning/memorizing (the error curves takes 
longer to become basically zero), but perhaps higher abil- 
ity for devising strategies. It is then allowed to conjec- 
ture that human nature evolved, during successive gen- 
erations, not so much to strongly improve the speed as- 
sociated with such kind of memorization, but rather to 
improve the capacity of spontaneously and quickly gen- 
erating intellectual strategies for performing tasks such 
as memorization. 

We also applied a similar experimental protocol for 
learning/memorizing how to analyze complex figure- 
background images in order to quickly realize alternative 
(typically two) interpretations of the figures. We veri- 
fied that, unless much more sophisticated experimental 
protocols and computational algorithms are deviced, cog- 
nitive tasks with important semantic content are by all 
means nontrivial to measure and compare. For such com- 
plex tasks, even more than for the simple binary learn- 
ing/memorization addressed here, the role of strategies 
might well be fundamental, although this remains to be 
proved. On more general grounds, the scenario which 
emerges is that nonextensivity seems to serve to humans 
for achieving abduction, one of Charles Sanders Pierce 
three basic forms of inference (see P3 and references 
therein). In other words, given its intrinsic nonlocal na- 
ture (strong collective correlations are necessary for mak- 
ing the entropic index q to differ from unity), it is plau- 
sible that nonextensivity constitutes the structure neces- 
sary to make metaphors. Given the very high intellectual 
level attributed, since Aristotle, to metaphors \v^, it is 
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allowed to think that it has some specific relation with the 
nature of the one that we might consider as the animal 
who makes metaphors. We may use "homo metaphori- 
cus" to express this concept. Further developments, on 
both philosophical and cognitive-psychological grounds, 
within the frame that we have outlined here would nat- 
urally be very welcome. They could reinforce or exclude 
the interpretation that our present human-machine com- 
parison suggests. 



FIG. 12: Time evolution of the error for the 5x5 matrix, 
where t m is the value of t at which the error becomes half 
of its value at t = 0. The dots correspond to averaging the 
experimental data with 120 humans. The continuous curve 
has been obtained averaging a large number of initial condi- 
tions for the perceptron with the gain parameter g = 0.91, 
the temperature-like parameter T = 0.02 and the entropic in- 
dex q — 1.02. It is with these values that optimal fitting was 
obtained for the experimental data. The parabolic extrapo- 
lation of the experimental data corresponding to t = 1, 2 and 
3 provides, for t — 0, the value of 0.316 for the percentual 
error, which corresponds to 0.316 x 25 = 7.9 for the absolute 
error. Therefore the value of t m corresponds to the value of 
t at which the absolute error equals 7.9/2 = 3.95. Since no 
integer value of t corresponds exactly to this value, a linear 
interpolation has been performed, yielding t m = 4.2 for the 
experimental data with 120 individuals. See [16] for further 
details. 
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